Importance Sampling for a Monte Carlo Matrix Multiplication Algorithm, with Application to Information Retrieval
نویسندگان
چکیده
We perform importance sampling for a randomized matrix multiplication algorithm by Drineas, Kannan, and Mahoney and derive probabilities that minimize the expected value (with regard to the distributions of the matrix elements) of the variance. We compare these optimized probabilities with uniform probabilities and derive conditions under which the actual variance of the optimized probabilities is lower. Numerical experiments with query matching in information retrieval applications illustrate that the optimized probabilities produce more accurate matchings than the uniform probabilities and that they can also be computed efficiently.
منابع مشابه
Monte Carlo Simulation of Multiplication Factor in PIN In0.52Al0.48As Avalanche Photodiodes
In this paper, we calculate electron and hole impactionization coefficients in In0.52Al0.48As using a Monte Carlo modelwhich has two valleys and two bands for electrons and holesrespectively. Also, we calculate multiplication factor for electronand hole initiated multiplication regimes and breakdown voltagein In0.52Al0.48As PIN avalanche photodiodes. To validate themodel, we compare our simulat...
متن کاملA Stochastic algorithm to solve multiple dimensional Fredholm integral equations of the second kind
In the present work, a new stochastic algorithm is proposed to solve multiple dimensional Fredholm integral equations of the second kind. The solution of the integral equation is described by the Neumann series expansion. Each term of this expansion can be considered as an expectation which is approximated by a continuous Markov chain Monte Carlo method. An algorithm is proposed to sim...
متن کاملRandomized Approximation of the Gram Matrix: Exact Computation and Probabilistic Bounds
Abstract. Given a real matrix A with n columns, the problem is to approximate the Gram product AA by c ≪ n weighted outer products of columns of A. Necessary and sufficient conditions for the exact computation of AA (in exact arithmetic) from c ≥ rank(A) columns depend on the right singular vector matrix of A. For a Monte-Carlo matrix multiplication algorithm by Drineas et al. that samples oute...
متن کاملStopping-time resampling for sequential Monte Carlo methods
Motivated by the statistical inference problem in population genetics, we present a new sequential importance sampling with resampling strategy. The idea of resampling is key to the recent surge of popularity of sequential Monte Carlo methods in the statistics and engineering communities, but existing resampling techniques do not work well for coalescent-based inference problems in population g...
متن کاملA Simplified Proof For The Application Of Freivalds' Technique to Verify Matrix Multiplication
Fingerprinting is a well known technique, which is often used in designing Monte Carlo algorithms for verifying identities involving matrices, integers and polynomials. The book by Motwani and Raghavan [1] shows how this technique can be applied to check the correctness of matrix multiplication – check if AB = C where A,B and C are three n×n matrices. The result is a Monte Carlo algorithm runni...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- SIAM J. Scientific Computing
دوره 33 شماره
صفحات -
تاریخ انتشار 2011